NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images

Naseh, Ali; Thai, Katherine; Iyyer, Mohit; Houmansadr, Amir (October 2024, COLM 2024)

Full Text Available
Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images

Naseh, Ali; Thai, Katherine; Iyyer, Mohit; Houmansadr, Amir (October 2024, COLM)

Full Text Available
Stealing the Decoding Algorithms of Language Models

https://doi.org/10.1145/3576915.3616652

Naseh, Ali; Krishna, Kalpesh; Iyyer, Mohit; Houmansadr, Amir (November 2023, ACM)

Full Text Available
A Critical Evaluation of Evaluations for Long-form Question Answering

https://doi.org/10.18653/v1/2023.acl-long.181

Xu, Fangyuan; Song, Yixiao; Iyyer, Mohit; Choi, Eunsol (January 2023, Association for Computational Linguistics)

Full Text Available
ezCoref: Towards Unifying Annotation Guidelines for Coreference Resolution

Gupta, Ankita; Karpinska, Marzena; Zhao, Wenlong; Krishna, Kalpesh; Merullo, Jack; Yeh, Luke; Iyyer, Mohit; O'Connor, Brendan (May 2023, Findings of the Association for Computational Linguistics: EACL 2023)
Vlachos, Andreas; Augenstein, Isabelle (Ed.)
Large-scale, high-quality corpora are critical for advancing research in coreference resolution. However, existing datasets vary in their definition of coreferences and have been collected via complex and lengthy guidelines that are curated for linguistic experts. These concerns have sparked a growing interest among researchers to curate a unified set of guidelines suitable for annotators with various backgrounds. In this work, we develop a crowdsourcing-friendly coreference annotation methodology, ezCoref, consisting of an annotation tool and an interactive tutorial. We use ezCoref to re-annotate 240 passages from seven existing English coreference datasets (spanning fiction, news, and multiple other domains) while teaching annotators only cases that are treated similarly across these datasets. Surprisingly, we find that reasonable quality annotations were already achievable (90% agreement between the crowd and expert annotations) even without extensive training. On carefully analyzing the remaining disagreements, we identify the presence of linguistic cases that our annotators unanimously agree upon but lack unified treatments (e.g., generic pronouns, appositives) in existing datasets. We propose the research community should revisit these phenomena when curating future unified annotation guidelines.
more » « less
Full Text Available
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation

https://doi.org/10.18653/v1/2023.emnlp-main.741

Min, Sewon; Krishna, Kalpesh; Lyu, Xinxi; Lewis, Mike; Yih, Wen-tau; Koh, Pang; Iyyer, Mohit; Zettlemoyer, Luke; Hajishirzi, Hannaneh (January 2023, Association for Computational Linguistics)

Full Text Available
SLING: Sino Linguistic Evaluation of Large Language Models

https://doi.org/10.18653/v1/2022.emnlp-main.305

Song, Yixiao; Krishna, Kalpesh; Bhatt, Rajesh; Iyyer, Mohit (January 2022, Empirical Methods in Natural Language Processing)

Full Text Available
RankGen: Improving Text Generation with Large Ranking Models

https://doi.org/10.18653/v1/2022.emnlp-main.15

Krishna, Kalpesh; Chang, Yapei; Wieting, John; Iyyer, Mohit (January 2022, Empirical Methods in Natural Language Processing)

Full Text Available
Revisiting Simple Neural Probabilistic Language Models

https://doi.org/10.18653/v1/2021.naacl-main.407

Sun, Simeng; Iyyer, Mohit (June 2021, Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies)
null (Ed.)
Recent progress in language modeling has been driven not only by advances in neural architectures, but also through hardware and optimization improvements. In this paper, we revisit the neural probabilistic language model (NPLM) of Bengio et al. (2003), which simply concatenates word embeddings within a fixed window and passes the result through a feed-forward network to predict the next word. When scaled up to modern hardware, this model (despite its many limitations) performs much better than expected on word-level language model benchmarks. Our analysis reveals that the NPLM achieves lower perplexity than a baseline Transformer with short input contexts but struggles to handle long-term dependencies. Inspired by this result, we modify the Transformer by replacing its first self-attention layer with the NPLM’s local concatenation layer, which results in small but consistent perplexity decreases across three word-level language modeling datasets.
more » « less
Full Text Available
Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation

https://doi.org/10.18653/v1/2022.emnlp-main.630

Vu, Tu; Barua, Aditya; Lester, Brian; Cer, Daniel; Iyyer, Mohit; Constant, Noah (January 2022, Empirical Methods in Natural Language Processing)

Full Text Available

« Prev Next »

Search for: All records